Optical character recognition: an illustrated guide to the frontier
نویسندگان
چکیده
We offer a perspective on the performance of current OCR systems by illustrating and explaining actual OCR errors made by three commercial devices. After discussing briefly the character recognition abilities of humans and computers, we present illustrated examples of recognition errors. The top level of our taxonomy of the causes of errors consists of Imaging Defects, Similar Symbols, Punctuation, and Typography. The analysis of a series of "snippets" from this perspective provides insight into the strengths and weaknesses of current systems, and perhaps a road map to future progress. The examples were drawn from the large-scale tests conducted by the authors at the Information Science Research Institute of the University of Nevada, Las Vegas. By way of conclusion, we point to possible approaches for improving the accuracy of today's systems. The talk is based on our eponymous monograph, recently published in The Kluwer International Series in Engineering and Computer Science, Kluwer Academic Publishers, 1999.
منابع مشابه
Intelligent Systems for Off-Line Handwritten Character Recognition: A Review
Handwritten character recognition is always a frontier area of research in the field of pattern recognition and image processing and there is a large demand for Optical Character Recognition on hand written documents. This paper provides a comprehensive review of existing works in handwritten character recognition based on soft computing technique during the past decade. KeywordsHandwritten Cha...
متن کاملEvolutionary Computing Techniques in Off-Line Handwritten Character Recognition: A Review
Handwritten character recognition is always a frontier area of research in the field of pattern recognition and image processing and there is a large demand for OCR on hand written documents. This paper provides a comprehensive review of existing works in handwritten character recognition based on Evolutionary computing technique during the past decade. KeywordsHandwritten Character recognition...
متن کاملHandwritten Character Recognition of South Indian Scripts: A Review
Handwritten character recognition is always a frontier area of research in the field of pattern recognition and image processing and there is a large demand for OCR on hand written documents. Even though, sufficient studies have performed in foreign scripts like Chinese, Japanese and Arabic characters, only a very few work can be traced for handwritten character recognition of Indian scripts es...
متن کاملA Survey on Offline Recognition of South Indian Scripts
Handwritten character recognition is always a frontier area of research in the field of pattern recognition. Even though, sufficient studies have performed in foreign scripts like Arabic, Chinese and Japanese, only a very few work can be traced for handwritten character recognition mainly for the south Indian scripts. Multiple combinations of vowels and consonants along with its modifiers led t...
متن کاملOffline Handwritten Gurmukhi Character Recognition using Particle Swarm Optimized Neural Network
The offline handwritten character recognition is the frontier area of research from last few decades in pattern recognition. It is difficult to recognize handwritten characters as compared to printed characters because of the varying writing styles of individuals. The massive work has been done in languages like Devnagri and Chinese character recognition. The area of Gurmukhi character recognit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000